AITopics | wasserstein metric

Collaborating Authors

wasserstein metric

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Robust Hypothesis Testing Using Wasserstein Uncertainty Sets

RUI GAO, Liyan Xie, Yao Xie, Huan Xu

Neural Information Processing SystemsFeb-13-2026, 23:02:32 GMT

Neural Information Processing Systems http://nips.cc/

detector, hypothesis, probability mass, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.05)
North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.52)

Add feedback

Learning from uncertain curves: The 2-Wasserstein metric for Gaussian processes

Anton Mallasto, Aasa Feragen

Neural Information Processing SystemsNov-21-2025, 11:16:22 GMT

We prove uniqueness of the barycenter of a population of GPs, as well as convergence of the metric and the barycenter of their finite-dimensional counterparts.

artificial intelligence, machine learning, modeling & simulation, (19 more...)

Neural Information Processing Systems

Country:

Europe > Denmark > Capital Region > Copenhagen (0.04)
Asia > Russia > Siberian Federal District (0.04)
North America > United States > Michigan (0.04)
(3 more...)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (0.69)
Health & Medicine > Therapeutic Area > Neurology (0.68)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Modeling & Simulation (0.84)

Add feedback

Population Matching Discrepancy and Applications in Deep Learning

Jianfei Chen, Chongxuan LI, Yizhong Ru, Jun Zhu

Neural Information Processing SystemsNov-21-2025, 06:14:10 GMT

One such estimation is maximum mean discrepancy (MMD).

gradient, pmd, wasserstein metric, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
Asia > Middle East > Jordan (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Vision (0.93)

Add feedback

Robust Hypothesis Testing Using Wasserstein Uncertainty Sets

RUI GAO, Liyan Xie, Yao Xie, Huan Xu

Neural Information Processing SystemsNov-20-2025, 18:48:53 GMT

We develop a convex safe approximation of the minimax formulation and show that such approximation renders a nearly-optimal detector among the family of all possible tests.

artificial intelligence, detector, machine learning, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.05)
North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.52)

Add feedback

Banach Wasserstein GAN

Jonas Adler, Sebastian Lunz

Neural Information Processing SystemsNov-20-2025, 18:22:32 GMT

We describe how BWGAN can be efficiently implemented.

artificial intelligence, banach space, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.30)

Add feedback

A Spectral-Grassmann Wasserstein metric for operator representations of dynamical systems

Germain, Thibaut, Flamary, Rémi, Kostic, Vladimir R., Lounici, Karim

arXiv.org Machine LearningSep-30-2025

The geometry of dynamical systems estimated from trajectory data is a major challenge for machine learning applications. Koopman and transfer operators provide a linear representation of nonlinear dynamics through their spectral decomposition, offering a natural framework for comparison. We propose a novel approach representing each system as a distribution of its joint operator eigenvalues and spectral projectors and defining a metric between systems leveraging optimal transport. The proposed metric is invariant to the sampling frequency of trajectories. It is also computationally efficient, supported by finite-sample convergence guarantees, and enables the computation of Fr echet means, providing interpolation between dynamical systems. Experiments on simulated and real-world datasets show that our approach consistently outperforms standard operator-based distances in machine learning applications, including dimensionality reduction and classification, and provides meaningful interpolation between dynamical systems. Dynamical systems are widely used across scientific and engineering disciplines to model state variables' evolution over time (Lasota & Mackey, 2013). Nonlinear ordinary or partial differential equations typically govern these systems and may incorporate stochastic components (Meyn & Tweedie, 2012). However, in many practical situations, analytical models are unavailable or intractable, motivating the use of data-driven approaches to infer the underlying dynamics from sampled trajectories. In this context, Koopman and transfer operator regressions have emerged as a powerful framework for learning and interpreting dynamical systems from data (Brunton et al., 2022). Rather than directly modeling the evolution of state variables, these operators advance observables (scalar functions defined on the state space) by mapping each to its expected future value conditioned on the current state. Crucially, these operators are linear even when the underlying systems are not linear.

barycenter, dynamical system, operator, (17 more...)

arXiv.org Machine Learning

2509.2492

Country:

Asia > Middle East > Jordan (0.04)
Europe > Serbia > Vojvodina > South Bačka District > Novi Sad (0.04)

Genre: Research Report (0.83)

Technology:

Information Technology > Scientific Computing (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Augmenting Bias Detection in LLMs Using Topological Data Analysis

Varadarajan, Keshav, Songdechakraiwut, Tananun

arXiv.org Artificial IntelligenceAug-12-2025

Recently, many bias detection methods have been proposed to determine the level of bias a large language model captures. However, tests to identify which parts of a large language model are responsible for bias towards specific groups remain underdeveloped. In this study, we present a method using topological data analysis to identify which heads in GPT-2 contribute to the misrepresentation of identity groups present in the StereoSet dataset. We find that biases for particular categories, such as gender or profession, are concentrated in attention heads that act as hot spots. The metric we propose can also be used to determine which heads capture bias for a specific group within a bias category, and future work could extend this method to help de-bias large language models.

category, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2508.07516

Country: North America > United States > Minnesota (0.28)

Genre: Research Report > New Finding (0.34)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Solving dynamic portfolio selection problems via score-based diffusion models

Aghapour, Ahmad, Bayraktar, Erhan, Yuan, Fengyi

arXiv.org Machine LearningJul-22-2025

In this paper, we tackle the dynamic mean-variance portfolio selection problem in a {\it model-free} manner, based on (generative) diffusion models. We propose using data sampled from the real model $\mathbb P$ (which is unknown) with limited size to train a generative model $\mathbb Q$ (from which we can easily and adequately sample). With adaptive training and sampling methods that are tailor-made for time series data, we obtain quantification bounds between $\mathbb P$ and $\mathbb Q$ in terms of the adapted Wasserstein metric $\mathcal A W_2$. Importantly, the proposed adapted sampling method also facilitates {\it conditional sampling}. In the second part of this paper, we provide the stability of the mean-variance portfolio optimization problems in $\mathcal A W _2$. Then, combined with the error bounds and the stability result, we propose a policy gradient algorithm based on the generative environment, in which our innovative adapted sampling method provides approximate scenario generators. We illustrate the performance of our algorithm on both simulated and real data. For real data, the algorithm based on the generative environment produces portfolios that beat several important baselines, including the Markowitz portfolio, the equal weight (naive) portfolio, and S\&P 500.

diffusion model, machine learning, reinforcement learning, (16 more...)

arXiv.org Machine Learning

2507.09916

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
Europe > Switzerland > Basel-City > Basel (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre: Research Report (1.00)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Quantization-based Bounds on the Wasserstein Metric

Bobrutsky, Jonathan, Moscovich, Amit

arXiv.org Machine LearningJun-4-2025

The Wasserstein metric has become increasingly important in many machine learning applications such as generative modeling, image retrieval and domain adaptation. Despite its appeal, it is often too costly to compute. This has motivated approximation methods like entropy-regularized optimal transport, downsampling, and subsampling, which trade accuracy for computational efficiency. In this paper, we consider the challenge of computing efficient approximations to the Wasserstein metric that also serve as strict upper or lower bounds. Focusing on discrete measures on regular grids, our approach involves formulating and exactly solving a Kantorovich problem on a coarse grid using a quantized measure and specially designed cost matrix, followed by an upscaling and correction stage. This is done either in the primal or dual space to obtain valid upper and lower bounds on the Wasserstein metric of the full-resolution inputs. We evaluate our methods on the DOTmark optimal transport images benchmark, demonstrating a 10x-100x speedup compared to entropy-regularized OT while keeping the approximation error below 2%.

artificial intelligence, coupling, machine learning, (18 more...)

arXiv.org Machine Learning

2506.00976

Country:

Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.34)

Add feedback

Learn A Flexible Exploration Model for Parameterized Action Markov Decision Processes

Wang, Zijian, Wang, Bin, Shao, Mingwen, Dou, Hongbo, Tao, Boxiang

arXiv.org Artificial IntelligenceJan-6-2025

Hybrid action models are widely considered an effective approach to reinforcement learning (RL) modeling. The current mainstream method is to train agents under Parameterized Action Markov Decision Processes (PAMDPs), which performs well in specific environments. Unfortunately, these models either exhibit drastic low learning efficiency in complex PAMDPs or lose crucial information in the conversion between raw space and latent space. To enhance the learning efficiency and asymptotic performance of the agent, we propose a model-based RL (MBRL) algorithm, FLEXplore. FLEXplore learns a parameterized-action-conditioned dynamics model and employs a modified Model Predictive Path Integral control. Unlike conventional MBRL algorithms, we carefully design the dynamics loss function and reward smoothing process to learn a loose yet flexible model. Additionally, we use the variational lower bound to maximize the mutual information between the state and the hybrid action, enhancing the exploration effectiveness of the agent. We theoretically demonstrate that FLEXplore can reduce the regret of the rollout trajectory through the Wasserstein Metric under given Lipschitz conditions. Our empirical results on several standard benchmarks show that FLEXplore has outstanding learning efficiency and asymptotic performance compared to other baselines.

machine learning, natural language, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

2501.02774

Country: